Appendix D 



I Title: 

1 BPI Specification 
I Draft Version 0.9 



1. General 

This document describes how to use the Bus-Peripheral-Interface. In the current 
version, this document is about BPI for FPI bus only. 

2. Function 

2.1. General concept 

A Bus-Peripheral Interface BPI completely handles the bus protocol of the chip internal 
interconnect bus; this BPI can therefore be seen as the standardized interface to the 
FPI-Bus. 

On the other side all BPIs adhere to the fundamentals given in "Platform Concept: 
SMIF Specification" (T. Steinecke, P. Schneider) and considered binding for all platform 
peripheral. 

The peripheral's functional code (named "kernel" throughout this document) is thus / 
independent of the used bus protocol. 

The main tasks of the BPI are 

• Buffering To define a predictable bus timing some standardisation on fan out 

and fan in is mandatory 

• Decoding As Kernel SFR's will have different addresses in different uC's, 

addresses are decoded inside BPI mainly. 

• Handshaking Bus and peripheral may use different clock rates (even dynamically 

!). Thus some way of synchronization has to be provided. 

• Error Handling Access to undefined locations and unclocked peripherals are func- 

tions not regarded legal under normal circumstances; some error 
notification via the bus may be needed. 

• Test Mode If special test modes are necessary the should be implemented 

likewise throughout a chip. 
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2.2. I/O signals from BPIJNTERFACE 



FPI - BUS 



FPLD.EN 

FPI RDY I 



FPI.RDY.O 

FPI_EN ^ 
FPI_RD_N — 
FPI_WR_N — 
FPI_SEL_N — 
FPLOPC 
FPLACK 
FPLTOUT 
FPLABORT.N 
XXX_CLK_ENJ 
XXX.BUS.FASTER 
BUS_CLK.ENJ 
HIGHJ4WORDJ 
PDFT.SCAN.MODEJ 
OCDS_P_SUSPENDJ 
XXX_EX_DISRJ 
XXX CLK ON I 





BUS- 



PERIPHERAL- 



INTERFACE 



SMIF - SIGNALS 

BPLDATA.O[31/15...0] 

BPI_DATAJ[31/15 ... 0] 

BPLWR_SFR_N[X ... 0] 

BPI _WR_BY_N[3/1 ... 0] 

BPI _RD_ SFR_N[X ... 0] 

BPI_RD_BY_N[3/1 ... 0] 

BPI.ABORT.N 

BPI_REQ_N 

BPLRDY 

BPI_A 

BPI.CS.N 

BPLRD.N 

BPLWR.N 

BPLPROTECT.MASK 

BPLACC^N 

KERNEL.ERRJ 

XXX_DlS_N_0 

GENERAL NOT USED 

XXX_GATING_EN_0 

BUS_GATING_EN_0 

XXX_DISREQ_N_0 

XXX DISACK N I 



Page 3 of 36 




The FPI signals are described in the in the FPI-Specification. 



Description of the interface ports to the SMIF: 



BUS.CLK 


Corresponds to the fpi_clk 


fpU"dy_i 


Corresponds to the fpi_rdy signal (fpi_rdy_b) from the FPI 


fpi_rdy_o 


Ready acknowlege from the BPI. Driven to the signal 
fni rdv b if enabled with the fDi en o sianal 


fpi_en_o 


Enable signal for the fpi_rdy_o and fpLack_o tristate 
driver 


ftai ack o 


Acknowleae sianal from the BPI Driven to the sianal 
fpLack(_b) if enabled with the fpi_en_o signal. 

Note: Must be INOUT for synopsys test compiler 


fpi_d_i 


Corresponds to the fpi_d from the FPI 


fpLd_o 


Driven by the BPI internal data register. Driven to 
fpLd(_b) if enabled with the fpi_d_en_o signal. 


fpi_d_en_o 


Enable signal for the fpi_d_o tristate driver 


BPI DATA I[31/15...0] 


Input data bus, either 32 or 16 bits wide. 
Source: Peripheral kernel port kernel_data_o 


BPI_DATA_O[31/15...0] 


Output data bus, either 32 or 1 6 bits wide. 
Destination: Peripheral kernel port kernel_data_i 


BPI_RD_SFR(x...O) 


Dedicated read signals, one each for each SFR. 


BPI_PxD_BY_N[3...0] 
(optional) 


Selection which bytes have to be read 


BPI _WR_ SFR(x...O) 


Dedicated write signals, one each for each SFR. 


BPI_WR_BY_N[3...0] 


Selection which bytes have to be written 


BPI_PROTECT_MASK 
[31/15...0] (optional) 


Mask for bit-protection. 

,0* means that the BIT must not be changed 

,1' means that the BIT must be changed in the peripheral. 


BPI_ABORT_N 
(optional) 


This signal is directly mapped to the FPI-BUS-signal 
FPI_ABORT_N. Each peripheral can abort any access in 

this way. In the current version this signal is not used by the inter- 
face!! This is wrong and will be resolved in the next version 


BPI_REQ_N 
(optional) 


Indicates a valid access if dynamic waitstate insertion is 
required (handshake_c=1). This signal is reset by 
BPI.RDY. 
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BPIJtDY 
(optional) 


Ready indication from the kernel after a dynamic waitstate 
insertion. While this signal is driven ,0', it is not possible to 
request a new access with the signal BPLREQ_N. 
Exactly one waitstate is inserted if BPLRDY is directly 
bridged to BPI REQ N in the kernel 


BPI A (optional) 


Synchronized address bits (latched FPLA) for the RAM- 
Interface. Only valid if a RAM-Interface is configured 


BPI_CS JSf (optional) 


Chip select signal for the RAM-Interface 


BPI_RD_N (optional) 


Read signal for the RAM-Interface 


BPLWRN (optional) 


Write signal for the RAM-Interface 


BPI_ACC_N 


This signal indicates to the kernel when a Write or Read 
access will take place. It is active for the period of one 
"Master clock" before and durina the risina edae of the 
peripheral clock. With this signal, multiple reads or writes 
to a peripheral register can be prevented. A normal read 
cycle doesn't need this signal, but if a register performes 
destructive read accesses this signal must be considered. 


XXX_BUS_FASTER 


Notifies the BPI module of a peripheral that the bus clock 
rate is currently higher than the peripheral clock rate. 
This speed indication is needed to perform zero Waitstate 
writp arrp^es to the oeriDheral's internal reoisters when- 
ever the Bus clock rate is equal to or slower than the 
peripheral clock rate. 

Connecting XXX_BUS_FASTER to static T will insert 
exactly one waitstate into each access. 


KERNEL.ERR 


Error signal from the Peripheral Core to the BPI module 
signalling an error condition during a not allowed RD/WR 
access from BPI module to the peripheral core. 
Must be connected to ,0* if not used. 

In the current version this signal needs to be driven before an access 
will be performed. This is not possible if the peripheral clock is turned 
off I 


OCDS_P_SUSPEND 


Notifies the peripheral to stop the peripheral clock for 
debuqqinq purposes. 


xxx_ex_disr 


Requests the peripheral to stop the peripheral clock. 


Bus_clk_en 


Enable signal for the BPI component clock domain. Is 
used for clock gating. 


xxx_clk_en 


Enable signal for the peripheral kernel clock domain. Is 
used for clock gating. 


Bus_gating_en 


Enable signal for clock gating of the peripheral clock 
domain. 
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xxx_gaung_en 


tnauie signal Tor uiuuf\ ycuiiiy ui ine pci ipi lciai uiuuiv 
domain. 


xxx_clk_on 


Special function pin for RTC. Must be connected to ,1' if 
not used! 


xxx_dis_n_o 


Special function pin for RTC. Leave open if not used. 


xxx_disreq_n 


Disable request signal to the kernel to switch off the 
peripheral clock. 


xxx_disack_n 


Disable acknowledge signal from the peripheral kernel to 
allow switching off of the peripheral clock. Bridge this sig- 
nal to xxx_disreq_n if no specific function is implemented 
in the kernel 


high_hword 


This signal is only considered during write accesses if the fpi data bus 
width is 32 bits and the kernel data bus width is 16 bits. 

High_hword = n 0" means that the lower 16 bits of the fpi 
bus carry valid data and are to be mapped to the kernel 
data bus. 

High_hword = J" means that the upper 1 6 bits of the fpi 
bus carry valid data and are to be mapped to the kernel 
data bus. 



2.3. Architecture of the whole peripheral and testbench 

A peripheral is a hierarchical construction (see Figure 2 below). 

The peripheral kernel can be given either as a structural or as a rtl description. 

The components <project>_bpi and <project>_kernel are contained within the module 

<project>_syn. (syn means synthesis unit) 

Example: p3_wdt_bpi and p3_wdt_kernel comprise p3^wdt_syn. 

On this project level, no tristate drivers are included, they are instantiated in module 
<project>_bus_driver. 

Example: p3_wdt_bus_driver. 

For the distribution of the master clock to the BPI and its kernel a special clock driver 
each will be instantiatied inside module <project>_clock_gating. 

Example: p3_wdt_clock_gating. 

The three modules <project>_bus_driver, <project> - clock_gating and <project> _syn 
are contained inside <project> 

The dedicated testbench (<project>_tb) for this toplevel module consist of the periph- 
eral itself (Example: p3_wdt)and the test units FPI (bus model).and <project>JOCJop 
(input output control). 
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Example: p3_wdt_iocJop. 

The figure below shows the internal hierarchy of the bpLinterface and the other components. 

Remark: All VHDL units should have a unique prefix in their name to signal their belonging to 
a certain peripheral. p3_xxx is used throughout this document as generic peripheral name. 
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Figure 1 : BPI architecture 
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<project>_tb 



r 



<project> 



CLOCK- 
MODEL 
(included 



FPI-BUS- 
MODEL 



f~ <project>_j 



syn 



<project>_ 
cloclc_galing 



<project>_ 
bus_drivcr 



<project>_bpi 



SMIF 



<projcct>Jccmel 



test-register 
(optional) 



I 



J 



extern 
input 



IOC-Model 



extern 
outputs 



I : 

Figure 2: Architecture of the whole peripheral and testbench 
If you use this example proceed as follows: 



j 



j 



• Complete the component <project> and<project>_syn with the external input and 
output signals given by your<project>_kernel. 

• Extend the signal list and modify the port map in file <project>_tbe_tba-a.vhd. 

• Change the constant nr_of_pins_c and edit the constant pin map of the 
<project>_IOC_top port map 

The <project>_tbe_tba-a.vhd contains an example. 

3. Parameter setting in the project package 

The configuration of the BPI is handled in the BPLPACKAGE, contained in the file 
<project>_bpi_pack-p.vhd. 

For every peripheral this package needs to be copied into the project packages direc- 
tory under a unique name, usually by prefixing the original name with the project pre- 
fix, e.g. P3_XXX.BPLPACK-RVHD. Configuration of the BPI is then accomplished by 
editing (extending) this file. One important point is editing the name of the package 
likewise ! 

Example for the watchdogtimer (WDT): 

$HWPROJECTA/HDL_PACKS/RTUP3_WDT_BPLPACK-RVHD. 



The personalised project package is then imported by all other units with the state- 
ment 
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USE W0RK.P3_XXX_BPI_PACK.ALL; 
Inside this package several constants are to be checked and changed if necessary: 



fpi_addressbuswidth_c 


Indicates the width of the incoming FPI_A - BUS 
Preset to 32 Bit. Normally no change necessary 


fpi_databuswidth_c 


Indicates the width of the bidirectional FPI_D - BUS 
Preset to 32 Bit. Normally no change necessary 


kemel_databuswidth_c 


Indicates how many bits are provided by the largest 
peripheral register in one access. Possible values are 32, 
16, 8. 

The constant sfr_size_c indicates the numbers of bytes 
and is calculated from kernel_databuswidth_c by: 
sfr_size_c := kernel_databuswidth_c / 8 


addressbuswidth_c 


The constant addressbuswidth_c indicates how many bits 
are to be decoded by the address decoder for the SFR 
select signals. 

In order to get the correct address of the register the bits 
fpLa(fpi_high_c downto fpi_low_c) will be decoded. 
Note that fpi_a(1 downto 0) are reserved for the byte 
selection. 

The default value is 8; {(7 downto 0)} 


fpi_Jow_c 


LSB of the decoded address for the SFR addressrange. 
Preset to 2. Normally no change necessary 


fpi_Jiigh_c 


MSB of the decoded address for the SFR addressrange. 
Preset to 7. Normally no change necessary 


handshake_c 


When you need a handshake-procedure between inter- 
face and kernel, set the constant handshake_c:=1; 
Preset to 0 


fpi_addr_width_c 


Range for bit_vector. 

Preset to 8. No change necessary unless the constant 
addressbuswidth_c was changed. 

This constant is only used to define the type of the follow- 
ing constant sfr_addr - c. 


sfr__addr_c 


Array of SFR-Addresses for the address decoder. 

Note that the first 4 addresses (x , 00 I x'04 f x 1 08,x ( 0C) are 

predefined ! 


destructive_read_c 


Indicates whether the peripheral includes destructive read 
registers. Preset to 0 
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dest — addr_c 


Subarray of SFR-Addresses (must also be present in 
sfr addr c) which are destructive read registers. Preset to 
.10". 

This constant is only relevant if destructive_read_c = 1 . 
If only one register is destread insert it twice 


The next group of constants is relevant if a RAM-interface is required only. 


ram_interface_c 


Gobal indication if a RAM is required! Preset to FALSE 


ram_addr_c 


The constant indicates the bit combination which shall 
create the select signal for the ram inside the kernel. 
Preset to „T This constant is compared to 

fpLa(ram_high_c downto ramJow_c) 
If ram_high_c is not equal to ram Jow_c define a subtype ! 


ram_addr_low_c 


indicates tne isd ot me rv\ivi aouress. rresei \o u 


ram_addr_Jiigh_c 


Indicates the msb of the RAM address. Preset to 7 
bpLa<=fpi_a(ram_addr_high_c downto ram_addrJow_c) 
Preset to bpi_a <= fpLa(7 downto 0) 


ram_lo\v_c 


Lob ot the cecoaeo ram aaaress. 

Preset to 8. Normally no change necessary 


ram_high_c 


Preset to 8 ; only 256 byte RAM 
If you set this constant, for example to the value 9 then 
don't forget to change the constant ram_addr_c to a 2-Bit- 
Vector. Example: 

std_ulogic_vector(ram_high_c downto ramJow_c) := JO" 
This sector indicates if a RAM or SFR are selected. 


ram_databuswidth_c 


Preset to 32 - the same as the FPI data bus. You can 
change it to 16 or 8 for another ram databuswidth. 

11 IN THE CURRENT VERSION III! 

1! ram_databuswidth_c must equal kernel_databuswidth_cl 
!! It is no own data input for the RAM inserted 11 


ram_access_c 


The result of the calculation ram_databuswidth_c / 8 



If a RAM-module is not configured (ramjnterface_c = FALSE), the RAM special signals 
must not be used in the kernel. These are 

BPI.AJ the registered fpLa(ram_low_c-1 downto 0) 

BPI_WR_N J write signal for the RAM 

BPI_RD_NJ read signal for the RAM 

BPLCS_N J chipselect signal for the RAM 
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4. Binary and Enhanced Mode 

For compatibility reasons some bus mapping is provided. This is controlled by the spe- 
cial signal high_hword. Only if the FPI databus width is 32 and the Kernel data- 
buswidth is 16 is it possible to multiplex the upper or the lower16 Bit of the FPI databus 
to the kernel databus independently from the Isbs of fpi_a ! Nomally this signal must 
be held '0'. 



fpi_d_i(31 .. 16) 



fx 



fpi_d_i(0 ..15) 



bpi_data_o(0 .. 15) 




0 



high_hword 



fpi_d_o(31 .. 16)-*- 
fpi_d_o(0 .. 15) 



bpi_data_i(0 ..15) 
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5. The FSM of the controlunit 



5.1. control_m1 

read_fast <= xxx_bus_faster_i='0' and fpi_rd_n_i='0' and fpLopcj /= JUT and 
fpL.seLnj-0' and fpi_rdy_i=T bpLseLnJ='0'; 

write_fast <= xxx.bus.fasterJ-O* and fpi_rd_nJ=T and fpLopcj /= J 111" and 
fpLseLnj-O' and fpLrdy_i=T bpL$eLn_i='0' and fpi_wr_nj- 0'; 

read_slow <= (xxx_bus_faster_i= t O < and fpLrd_n_i-0' and fpLopc_i /= J 11 1" and 
fpLsel_n_i= t O < and fpi_rdyj=T bpi_seLn_i- 0') 
or (read_fast and bpi_desU=T; 

write.slow <= xxx_bus_faster_i- 0' and fpi_rd_n_i=T and fpLopcj /= J 1 1 1" and 
fpLseL.nJ-0 1 and fpLrdy_i= l 1* bpLseLn_i= t O t and fpLwr_n_i= < 0 < ; 

error <= fpi_tout_i=T or kernel_errj- 1 '; 



error or else 




write_fast 
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5.2. controI_h ( handshake) 

read <= fpLrd_n_i='0' and fpi_opcJ /= J 1 11" and 

fpLsel_n_i='0' and fpi_rdyj=T bpLsel_n_i= l O i 

write <= fpLrd_n_i= l 1 ' and fpi_wr_nj- 0' and fpi_opc_i /= „1 1 1 1" and 
fpLsel_nj- 0* and fpi_rdy_i=T bpLseL.nj-0' 



error or else 




5.3. FSM for error handling 
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An error occured if 

• the peripheral is selected but the SFR address is wrong, 

• a access is performed while the kernel clock is switched off or 

• a access is performed while the kernel indicates a kernel error 

In the e_read state the fpi data bus nonetheless must driven by the peripheral. 

If the kernel clock is switched off only accesses to the clock control register is allowed. 

In case of timing problems there is a constant error_handling_C defined inside the architecture 
p3_xxx_bpi-rtl-a.vhd to switch off this errorhandling (constant error_handling_c: boolean:=false;)! 
In this case the fpi_ack is in everytime NSC! 



read error 



read error 



read error 



write error 




write error 



5.4. The read modify write FSM 

The rmodw FMS rememberes the read part of a modify access and waites for the corresponding 



write access. 



read and 
fpLwr_nj-0' 



else (read) 




waitstates 
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6. I/O signals from Peripheral Kernel 



<PERIPHERAL> - KERNEL 


XXX_RESET_NJ ► 






XXX_CLK_I ^ 




— KERNEL_DATA_O[31/15...0] 


KERNEL_DATAJ[ 31/15 ... 0] — 
BPI_WR_SFR__N_I[ x ...0] ^ 




^ BPI_RDY_0 


BPI _WR_BY_NJ[ 3/1/0 ... 0] ^ 




^ KERNEL.ERRJ) 


BPI RD SFR N ir x 01 — 




— ► xxx_disack_n_o 


BPI_RD_BY_N_I[ 3/1/0 ...0] ^ 






BPLABORT.NJ 






RPT RPD NT T ^ 






BPI_AJ ► 






BPLCS.NJ ^ 






BPI RD N I ► 






BPI,WR_N_I ► 






BPI_PROTECT_MASKJ ^ 






BPI_ACC_NJ ^ 






xxx_disreq_nj ► 






PERIPHERAL - ^ 




^ PERIPHERAL - 


SPECIAL ► 




► SPECIAL 


INPUTS ^ 




^ OUTPUTS 







The signals have already been described above. 

If no error indication from the kernel is required, two signals must be driven nonethe- 
less by the architecture of the kernel: 

kerneI_err_o <= ,0*; - special error from kernel 
xxx_disack_n_o <= xxx_disreq_n_i; - always ok to disable clock 
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7. How you can use the port signals 



7.1. Write enable logic 

A register normally consist of one, two or four bytes. In order to write each byte sepa- 
rately the BPI provides a dedicated enable signal (BPI_WR_SFR_NJ(x)) for each SFR 
and a separate enable signal (BPI_WR_BY_NJ(3..0)) for each byte of the data bus. 
For each byte of each register the write enable will be generated from this signals. 
If the signal BPLWR_SFR_NJ of the SFR and all signals of the BPI_WR_BY_NJ are 
low, then the whole register is enabled to store the new data. 

The figure below illustrates the proposed circuit 









BPLWR_SFR_NJ(4) 


enable- 
logic 


write enable for Byte 3, SFR 4 


BPLWRJBY_NJ(3) 


^ 




BPI_WR_SFR_NJ(4) 






enable- 
logic 


write enable for Byte 2, SFR 4 


BPLWR_BY_NJ(2) 


: ► 




BPI_WR_SFR_NJ(4) 






enable- 
logic 


write enable for Byte 1, SFR 4 


BPI_WR_BY_NJ(1) 


^ 




BPI_WR_SFR_NJ(4) 






enable- 
logic 


write enable for Byte 0, SFR 4 
: ► 


BPI_WR_BY_N_I(0) _ 







Caution!! 

The BPLWR_SFR_NJ[0 3] and BPI_RD_SFR_NJ[0 .. 3] are reserved normally !! 

SFR(O) is the clock control register and is instantiated in the BPI. 

SFR(1) is the peripheral input select register 

SFR(2) is the identification number 

SFR(3) is reserved for future use 
The correlation between index and address is given by the order of the addresses in 
the addresslist in the bpLpackage. 

Note that only SFR(O) is implemented inside the BPPs architectures (clc-entity) 



i 
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32 BIT REGISTER 



KERNEL_DATAJ[31..24] ^ 
write enable for Byte 3, SFR 4 


Byte 3 
Byte 2 

Byte 1 

ByteO 








SFR(O) data out 


► 

KERNEL_DATA_I[23...16] ^ 

write enable for Byte 2, SFR 4 
— : : ► 




KE RNE L_D ATA_I [ 1 5 . . 8 ] ^ 
write enable for Byte 1, SFR 4 






► 

KERNELJDATAJ[7 " 0] ^ 
write enable for Byte 0, SFR 4 







Page 18 of 36 



7.2. Multiplex output data 



The next figure shows one possibility to propagate the output data of the kernel. Note 
that multiplexing RAM data and SFR data in two stages is only an example. 



SFR(l)data out 



SFR(2) data out 



SFR(4) data out 



I 

SFR(n) data out 



MUX 



BPI_RD_SFRJSLI(n... 1) 



KERNEL_DATAJ[3 1/15...0] 

BPI.AJ 

BPI_CS_N_I 
BPI_RD_N_I 
BPLWR_NJ 
BPI ACC N I 



SFRJDATA[31/15...0] 




RAM-Module 



KERNEL_DATA_O[31/I5...0] 



BPI CS_N_I 



RAM_DATA[31/15...0] 
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7.3. Read modify write cycle and protect mask 



Read-Modify-Write is coded on the FPI bus and recognized by the BPI interface. The 
read modify write signal is not forwarded to the kernel. 

As a result a signal RMODW_N_S is low active for the complete read modify write 
cycle on the FPI-BUS. As no other SFR may be read intermittingly the read data can 
be stored in the read register. A protection mask can now be derived by comparing the 
(locally stored) read data with the new data to be written. All bits not changed are 
given a mask value 0 if the RMODW_N_S signal is active. 
In a normal write or read cycle, all bits of the mask are J". 

Protect mask bit equal to J" means that the data bit must be written into the register. 
Protect mask bit equal to „0 U means that the data bit must not be written into the regis- 
ter. 

This is illustrated by the next figures. 



bin- or enh-mode 




fpi_d_en_n_o 



bpi_data_o 
1011 



^ bpi_protect_mask_o 

0001 in the rmodw-cycle 
otherwise 1111 



bpi_data_i 



clock 



rmodw n_s 



read enable 



The bpLdataj are only captured at the end of the read-cycle and remain unchanged 
by the write- and idle-cycles on the FPI bus. The protect mask is provided every time. 
Only register bits which can be modified by SW and HW need to use the protection 
. mask. 
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clock 

address 

read 

write 

rmodw 

fpi_rdy 

fpi_d (bpi_data_o) 
bpi_protect_mask 
register contents 
hardware action 
software action 





















Al 


Al 


A2 


A3 , 


A3, 


A4 


A5 



-uj — I 



1010 



1011 





0000 


0001 . 




i 




t 


1010 


1101 


1111 


1 1 

iread 1 


writel | 
writel i 



I. I 

The two figures below illustrate how to use these signals 



KERNEL_DATA_I[7 . .0] 



write enable for Byte 0, SFRA 



BPI_PROTECT_MASK_I[7^ 



enable logic 
forBit(7) 



Bit(7) 



enable logic 
forBit(O) 



Bit(7) 



EXTERN JDATAJ 
KERNEL_DATA_I 



BPIJ>ROTECT_MASK_I[7..0] 



intern write enable(BPI) 
extern write enable 



enable logi 
for Bit or 
Byte 




DATA_0 
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8. Peripheral Address Map 



Each peripheral has n x 256 Byte register blocks or n x 64 32-Bit register. The register is 
selected with signals BPI_WR_SFR_N J or BPI_RD_SFR_N_I . Four registers are fixed: 

• Address OOH is the peripheral clock control register. 

• Address 04H is the peripheral port input select register. 

• Address 08H is the peripheral identification. No real register, hard coded. 

• Address OCH is reserved. 

Control registers start at address 10H, followed by the data register. 
The interrupt control register start at the address FFH decreasing order. 



FFH 



10H 
OCH 
08H 
04H 
OOH 




Interrupt Control 
Register 



Data 
Register 
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9. Usage of the fixed Registers 

9.1. XXXID -REGISTER 

The XXXID is not a real register, it is hard coded and readable only. 
Example : 

perid.s <= .101 001 11"; 
or 

perid_c : std_ulogic_vector(8 downto 0) := J 01 001 1 1"; 
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It is selected with bpLrd_sfr_n_i(2). 

It should be assured by appropriate donMouch attributes that the ID is implemented 
on silicon as matel contacts. That allows cange of revision for metal redesigns. 



9.2. XXXPISEL - REGISTER 

The XXXPISEL register indicates which port shall be used for kernel inputs. 

The register has to be implemented by the peripheral designer and is selected with 

bpLwr_sfr_n_i(1). Every bit in this register controls one multiplexer. 

Therefore, every external input of the kernel can be driven by one of two sources. 

Example : 

If XXXPISEL(O) = ,0' then kernel signal <= Port_0 else kernel signal <= Port_1; 



Port_rN_0 



PortJNJ 




1 



9.3. XXXCLC - REGISTER 

The state of the peripheral clock is controlled by a register bit "XXXDISR" located in 
the register XXXCLC of the peripheral core. This register XXXCLC is clocked with the 
bus clock to be able to switch the peripheral clock on again if it was off. If required by 
the peripheral's functionality switching off the clock can be prevented by the periph- 
eral. The actual clock state will be shown by the state bit "XXXDIS" within the same 
register. The signal U XXX_DIS_N" which is a combinatorial combination of other clock 
control signals represents the actual enable state of the peripheral clock and is used 
to switch on/off the peripheral clock. The clock gating buffer is located in the Clock 
Gating block of a peripheral. The clock enable signals used to control the clock 
speeds are generated in the central Clock Generation and distributed to the Clock 
Gating block. 

The BPI module rejects every FPI bus access with Error condition if the peripheral 
clock is switched off (signaled by the signal Kernel_err_o- driven by clc module, not 
peripheral kernel). 

9.3.1. Coherence between XXXCLC and the OCDS_P_SUSPEND, XXX_EX_DISR 

For On Chip Debugging support an additional signal OCDS_P_SUSPEND is intro- 
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duced to stop the peripheral clock for debugging if this function is enabled. If debug- 
ging mode is active the peripheral core rejects write access to registers connected to 
the peripheral clock by activating the signal Kernel_err. This causes the BPI module to 
reject FPI bus accesses with Error condition. Read accesses to registers of the 
peripheral clock domain are possible. 

To be compatible with old products an XXX_EX_DISR signal is introduced to disable 
the peripheral clock. 

For more information see also the Peripheral Clock Strategy Specification 



XXXCLC - REGISTER 
XXXDISR 



(OL 



KERNEL_DATA_I[7..0] 



OCDSJLSUSPENDJ 
XXX_EX_DISR I 



BitO 



(21 



Bitl 



XXXDIS 



Bit2 



Bit3 



GPSEN 



XXX_EX_DIS 



[> 



OCDS DISR 



Lo<l- 



<project>_kemel 
xxx_disack_n_o ^ xxx_disreq_n_i 

kernel depended or 
direct connection 



BPI_SFR__RD_NJ 
BPI SFR WR N I 



KERNEL_ERR_0 
to the BPI control 



if a read or 
write access 
is atowed 
no error 



ifa=0 , 
and b» Fals^" 
1 

else y 0 



y and b» Fa 
|~ then y <=» 
J else v <- 



ifa=0 
and b*» FaU( 
then y <- 1 
else y <= 0 



clc data output is Bit(3 downto 0) 




ir u w 



XXX_DIS_0 
-o ► 



KERNEL.ERRJ 
from the kernel 



NOR-gatO 



1 



0 


0 


T 


T 


1 


0 


T 


T 


0 


1 


F 


F 


1 


1 


F 





WREN 



RD_EN 



Page 24 of 36 




10. Forwarding 



If a read access immediately follows a write cycle, it is normally not possible to read 
the previously written data in a zero waitstate access due to the required synchronisa- 
tion. To resolve this problem no forwarding multiplexer is provided; instead a waitstate 
will be inserted if a read access is performed after a write access to the same register 
and without any idlestates in between ! 

11. Timing Diagrams 

As there is no fixed relationship between bus clock and peripheral clock several differ- 
ent timing diagrams are provided. 

The first possibility is, that the bus clock is faster (bus_faster is ,1') as the peripheral 
clock. 

The second possibility is, that the bus clock is slower tahn or exactly as fast as the 
peripheral clock (bus_faster is ,0 l ). 

If it's necessary to implement a RAM inside the kernel the cycle must start with one waitstate. 
This is not implemented! 

The BPLACC_N_I signal is used to avoid multiply writes to the same address due to 
to fast a peripheral clock. It will be driven low activ one clockcycle before the relevant 
rising edge of the peripheral clock. It is activated whenever a register must store the 
data in the write cycle or change the data after a destructive read cycle. 
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BUS CLK FASTER THAN XXX CLK 
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BUS CLK FASTER THAN XXX CLK 
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BUS CLK SLOWER OR EQUAL THAN XXX CLK 
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BUS CLK SLOWER OR EQUAL THAN XXX CLK 
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HANDSHAKE BETWEEN INTERFACE AND KERNEL 
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RAM INTERFACE (not implemented) 

The current version is like a noma) access and provides the RAM signal but not in the timing below 
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RAM INTERFACE - BUS SLOWER THAN KERNEL 



MASTER.CLK 

BUS.CLK 

FPLADDRESS 

FPI.DATA 

FPI.RDY 

SLOW CLK EN N 



n. 



Al 




A3 








Dl 







XXX_CLK 

KERNEL.DATAJ 

BPI_AJ 

BPI_CS_NJ 

BPI_WR_NJ 

BPI RD N I 



WR-DATA 



RAM-ADRESS 



XXX.CLK 

KERNEL_DATAJ 

BPI_AJ 

BPLCSJVJ 

BPLWR_N_I 

BPI_RD_N I 



WR-DATA 



RAM-ADRESS 



RAM INTERFACE - BUS SLOWER THAN KERNEL 
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12. Module Developers Test Cook Book 

This chapter is a copy of Hermann Obermeir's paper regarding this 
topic; contact him for updates ! 

Platform peripherals are prepared for ATPG and for isolated module test. The following gives 
a brief overview what a module developer has to do. 

12.1. Automatic Test Pattern Generation(ATPG) 

12.1.1. Prepare setup files 

e.g. GNUmakefile.setup 
INSERT_SCAN_DESIGN := true 
SCAN.CHAINS := <desired_number> 

12.1.2. Run check test 

- In Design Analyzer: Tools -> Test Synthesis: Check Design Analyzer 

- you should have no violations 

12.1.3. Insert Scan Registers and generate pattern 

Use "ssemake insert_scan" to insert scan registers. 
Build scan chains up to a maximum length of 100 flipflops. 

serial inputs of the scanchains: pdft_sci_(U, pdft_scM_i, 

serial outputs: pdft_sco_0_o, pdft_sco_1_o,.... 
scan enable input: pdft_scen_i 

12.1.4. Generate test Pattern 

"ssemake atpg" You should achieve 100% fault coverage. 

(If not make sure with a fault simulation of your module test pattern, that the untested faults are 
tested with your module test pattern) 

12.1.5. Save Results 

Save Results (Fault coverage.unvovered fauts) 

Save constraints for the ATPG tool(setJesLhold t setjest_assume), if they have been used. 



12.2. Preparation for isolated module test 

The platform peripherals are tested in an isolated module test. The peripheral signals are made 
transparent at the chip boundary e.g. using multiplexers. 
There are three categories of pins: 

FPI bus: the FPI bus is made transparent via EBC/EBU 

Alternate l/Os are made visible via normal prot functions 

Outputs to other modules ( here called intermoduleoutputs) are multiplexed in the ports 
Inputs from other modules are multiplexed in the peripherals (intermoduleinputs) 

12.2.1. Insert Testregister and Multiplex Inputs 

Usually 1 test register bit is required. Use .... as an example 

All InterModulelnputs are fed into a multiplexer and a testinput tim_.... is created, which will be 
connected to external pin on chip level 
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any_im_input = anyjntermodulejnput 
tim = prefix for "test isolatd module" 



tr_sU, jm_clock_tr_si, jm_test_reset_nj t jm_outen_tr_si are module inputs 
tr_so_o is a module output. They will be connected at chip level 

Further Cases: 

- No test register: Some modules (SSC, ASC, IIC) have no dedicated intermodule inputs. 
In this cases the test register may be omitted. Such execptions have to be approved 
bye Georg Sigl, because they reduce testability. 

- Addtional test register bits are necessary, if the module increases IDDQ-current (e.g. 
pullups, pulldowns) or the test register needs to be in a special mode for testing other 
peripherals (e.g. for clock generation) 

12.2.2. Insert MISR 

Insert Signature Registe Kann es je vorkommen, dass man das Signaturregister im Betrieb mit- 
laufen lassen moechte, dann braeuchten wir ein eigenes testregister Bit, ansosnter gemein- 
sam nit Eingansgsmultiplex 

12.2.3. Insert MISR 

You need a MISR, if your module has many inter module outputs (more than 5 to 10) 

- Generate MISR using genbist (length is number of your intermodule outputs, but mini- 
mum 20 

- add an asynchronous reset to the misr: jm_test_reset_i 

- the bcode inputs of the MISR are test pins to your module: pdft_misr_bcode[1:0] 

- the intermoduleoutputs are fed into the parallel MISR inputs 

- the msb ofthe misr will be a test 

- in test mode of the peripheral the MISR follows is controlled by the bcode test pins, in 
functional mode a synchronous reset is applied to bcode (bcode[1:0] ='10' 

12.2.4. Select Module Pattern 

The module pattern are used for a functional/performance test ofthe modul. 

select a set of PDL files and store them. 

Add a comment into your PDL file, where 

If ATPG could not achieve 100% fault coverage make these 



12.3. IDDQ Test preparation 

IDDQ test vectors will be selected for platform products using Viewlogic/Sunrise. All peripheral 
have to fulfill the testability rules for this tool. Its not yet determined, if we have to check every 
peripheral with Sunrise/testability checker. 



12.4. Fault Coverage 



12.5. Document Testability in Module Specification 

Document Testability in your Module Specification. Use SSC as an example 



13. Product Developers Test Cook Book 

tbd 

13.1. Preparations 
13.1.1. Port Description 

A description of the ports in computer readable format is necessary 

13.2. Make Top Level Entity 

13.2.1. Wiring of Test Register 

13.2.2. Wiring of Scan Registers 

13.3. Make Testbench 

The necessay testbench contains 



